-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
✨ Support taxa="plants"
in EnsemblGene
#153
Conversation
Could you check if this is the correct db? bionty/bionty/base/entities/_gene.py Line 104 in e75518c
I think for plants it's a different db. |
I think it should be the same database (although it is not working). See https://plants.ensembl.org/info/data/mysql.html
and
But I can't locate, for example, a database such as
Which should be there as it is a model organism.. |
Could you try |
also an |
This is particularly odd: loading from
but not for items in
|
Found the fix. Needed to add the right port for some reason:
It seems the plant and the vertebrates databases use different ports. I will alter this PR to include a check. |
What about using |
kindeom=plants
in EnsemblGene
kindeom=plants
in EnsemblGene
kindeom=plants
in EnsemblGene
kindeom=plants
in EnsemblGene
kindeom=plants
in EnsemblGene
kindeom=plants
in EnsemblGene
kindeom="plants"
in EnsemblGene
kindeom="plants"
in EnsemblGene
kindom="plants"
in EnsemblGene
I think this is still incomplete, actually--loading in the downloaded Ensembl plant genes doesn't work because the data frames don't have the |
@mossjacob I'll look into this. I'll report back |
Ah, this is similar to the yeast case, could you take a look here? https://bionty-assets-gczz.netlify.app/ingest/gene-ensembl-release-112#saccharomyces-cerevisiae |
Thanks, I took a look at that notebook. I think it is the same as that example, and I get the same output ("no ensembl_gene_id found, writing to table_id column."), but then when I try to run:
it looks for the field
|
Signed-off-by: zethson <[email protected]>
kindom="plants"
in EnsemblGene
kingdom="plants"
in EnsemblGene
Signed-off-by: zethson <[email protected]>
Dear @mossjacob, sorry, I'm still catching up.
After lunch, I will look into your last issue. I'll report back! |
Signed-off-by: zethson <[email protected]>
kingdom="plants"
in EnsemblGene
taxa="plants"
in EnsemblGene
Concerning
@sunnyosun made me aware that this is a current limitation of
Does this help you? Edit: Sorry for closing - I fat fingered the wrong button. |
Hi! Thank you for this!
which I know is not ideal! |
It's super cool that you figured this out even though _ontology_id_field is not user-facing at all! Then no need to try |
I enjoy a good debug :) |
Okay so apparently Pandas 2.2 is not compatible with sqlalchemy 1.4 which I still had on my PC. I reverted the changes now that I made earlier to the SQL statements that fixed that. I'll make the CI run on this PR soon and then we can consider merging this. Would you like us to also add some plant organisms genes such as arabidopsis thaliana to Bionty so that it works out of the box for you? |
Signed-off-by: zethson <[email protected]>
sqlalchemy < 2 is no good to use anymore! 😇 😆 |
Impressively low-level contributions! @mossjacob 😄 |
Thanks everyone. Re adding to bionty-assets, while that would be nice, I envisage using quite a few different species so adding all to bionty-assets may be overkill at this point? In this PR there's a
With the change I made in this other PR, the URL to the local parquet file created can be added to [edit] the code has been updated |
Slightly altered the way gene tables are onboarded: the check for the |
Signed-off-by: zethson <[email protected]>
Signed-off-by: zethson <[email protected]>
Signed-off-by: zethson <[email protected]>
Great @mossjacob! Thank you very much for your enthusiasm and contributions.
I am ready to merge the PR now unless you want to keep building here? We'll also merge your sister PR for local |
Hi @Zethson , I am also ready for this to be merged in now! I still have to write a test for the local parquet file PR though. |
Some use cases, for example adding plants, requires adding a keyword argument specifying the kingdom in
EnsemblGene
.